IRC-CLVul: Cross-Programming-Language Vulnerability Detection with Intermediate Representations and Combined Features
نویسندگان
چکیده
The most severe problem in cross-programming languages is feature extraction due to different tokens programming languages. To solve this problem, we propose a cross-programming-language vulnerability detection method paper, IRC-CLVul, based on intermediate representation and combined features. Specifically, first converted programs into unified LLVM (LLVM-IR) provide classification basis for Afterwards, extracted the code sequences control flow graphs of samples, used semantic model extract program information graph structure information, concatenated them vectors. Finally, Random Forest learn vectors obtained results. We conducted experiments 85,811 samples from Juliet test suite C, C++, Java. results show that our improved accuracy by 7% compared with two baseline algorithms, F1 score showed 12% increase.
منابع مشابه
(CLSCR) Cross Language Source Code Reuse Detection Using Intermediate Language
In today's digital era information access is just a click away. so computer science students also have easy access to all the source codes from different websites thus it has become difficult for academicians to detect source code reuse in students programming assignments. The new trend in the area of source code reuse is using the source code by translating it in another programming language p...
متن کاملIntermediate integer programming representations using value disjunctions
We introduce a general technique to create an extended formulation of a mixed-integer program. We classify the integer variables into blocks, each of which generates a finite set of vector values. The extended formulation is constructed by creating a new binary variable for each generated value. Initial experiments show that the extended formulation can have a more compact complete description ...
متن کاملProgramming Language Features for Refinement
module A { type T function F(x: T): T function Twice(x: T): T { F(F(x)) } } module B refines A { type T = T’ datatype T’ = Leaf(int) | Node(T, T) function F... { match x case Leaf(w) => Leaf(w+1) case Node(left, right) => Node(F(left), F(right)) } } Figure 0. Two example modules, one (B) declared as a refinement of the other (A). • Refine previously given specifications (for example, strengthen...
متن کاملEnglish Event Detection With Translated Language Features
We propose novel radical features from automatic translation for event extraction. Event detection is a complex language processing task for which it is expensive to collect training data, making generalisation challenging. We derive meaningful subword features from automatic translations into target language. Results suggest this method is particularly useful when using languages with writing ...
متن کاملCross-Language Prominence Detection
We explore the ability to perform automatic prosodic analysis in one language using models trained on another. If we are successful, we should be able to identify prosodic elements in a language for which little or no prosodically labeled training data is available, using models trained on a language for which such training data exists. Given the laborious nature of manual prosodic annotation, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2023
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics12143067